Dataset statistics
| Number of variables | 20 |
|---|---|
| Number of observations | 7146 |
| Missing cells | 8168 |
| Missing cells (%) | 5.7% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 3.3 MiB |
| Average record size in memory | 491.1 B |
Variable types
| Numeric | 13 |
|---|---|
| Categorical | 3 |
| Text | 3 |
| DateTime | 1 |
PropType is highly imbalanced (62.8%) | Imbalance |
Hbath is highly imbalanced (51.5%) | Imbalance |
CondoProject has 6261 (87.6%) missing values | Missing |
Extwall has 926 (13.0%) missing values | Missing |
Rooms has 443 (6.2%) missing values | Missing |
Bdrms has 443 (6.2%) missing values | Missing |
Units is highly skewed (γ1 = 42.80622276) | Skewed |
Lotsize is highly skewed (γ1 = 33.88617801) | Skewed |
Rooms has 122 (1.7%) zeros | Zeros |
Fbath has 509 (7.1%) zeros | Zeros |
Lotsize has 489 (6.8%) zeros | Zeros |
Reproduction
| Analysis started | 2024-02-28 20:26:13.347981 |
|---|---|
| Analysis finished | 2024-02-28 20:26:28.923777 |
| Duration | 15.58 seconds |
| Software version | ydata-profiling vv4.6.5 |
| Download configuration | config.json |
PropertyID
Real number (ℝ)
| Distinct | 7055 |
|---|---|
| Distinct (%) | 98.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 178756.77 |
| Minimum | 98461 |
|---|---|
| Maximum | 266040 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 56.0 KiB |
Quantile statistics
| Minimum | 98461 |
|---|---|
| 5-th percentile | 105790.25 |
| Q1 | 136233.25 |
| median | 176670.5 |
| Q3 | 221564.25 |
| 95-th percentile | 253672.25 |
| Maximum | 266040 |
| Range | 167579 |
| Interquartile range (IQR) | 85331 |
Descriptive statistics
| Standard deviation | 47982.982 |
|---|---|
| Coefficient of variation (CV) | 0.26842609 |
| Kurtosis | -1.2318058 |
| Mean | 178756.77 |
| Median Absolute Deviation (MAD) | 42517 |
| Skewness | 0.059076039 |
| Sum | 1.2773959 × 109 |
| Variance | 2.3023665 × 109 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 176566 | 2 | < 0.1% |
| 183128 | 2 | < 0.1% |
| 236196 | 2 | < 0.1% |
| 214075 | 2 | < 0.1% |
| 172832 | 2 | < 0.1% |
| 141917 | 2 | < 0.1% |
| 114543 | 2 | < 0.1% |
| 141962 | 2 | < 0.1% |
| 213578 | 2 | < 0.1% |
| 213568 | 2 | < 0.1% |
| Other values (7045) | 7126 |
| Value | Count | Frequency (%) |
| 98461 | 1 | |
| 98464 | 1 | |
| 98508 | 1 | |
| 98519 | 1 | |
| 98561 | 1 | |
| 98593 | 1 | |
| 98604 | 1 | |
| 98608 | 1 | |
| 98696 | 1 | |
| 98715 | 1 |
| Value | Count | Frequency (%) |
| 266040 | 1 | |
| 266025 | 1 | |
| 266017 | 1 | |
| 266009 | 1 | |
| 265996 | 1 | |
| 265962 | 1 | |
| 265958 | 1 | |
| 265953 | 1 | |
| 265945 | 1 | |
| 265842 | 1 |
PropType
Categorical
IMBALANCE 
| Distinct | 6 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 474.7 KiB |
| Residential | |
|---|---|
| Condominium | |
| Commercial | 240 |
| Lg Apartment | 238 |
| Manufacturing | 6 |
Length
| Max length | 13 |
|---|---|
| Median length | 11 |
| Mean length | 11.0007 |
| Min length | 6 |
Characters and Unicode
| Total characters | 78611 |
|---|---|
| Distinct characters | 24 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Manufacturing |
|---|---|
| 2nd row | Commercial |
| 3rd row | Residential |
| 4th row | Residential |
| 5th row | Residential |
Common Values
| Value | Count | Frequency (%) |
| Residential | 5774 | |
| Condominium | 887 | 12.4% |
| Commercial | 240 | 3.4% |
| Lg Apartment | 238 | 3.3% |
| Manufacturing | 6 | 0.1% |
| Exempt | 1 | < 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| residential | 5774 | |
| condominium | 887 | 12.0% |
| commercial | 240 | 3.3% |
| lg | 238 | 3.2% |
| apartment | 238 | 3.2% |
| manufacturing | 6 | 0.1% |
| exempt | 1 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| i | 13568 | |
| e | 12027 | |
| n | 7798 | |
| d | 6661 | |
| a | 6264 | |
| t | 6257 | |
| l | 6014 | |
| R | 5774 | |
| s | 5774 | |
| m | 2493 | 3.2% |
| Other values (14) | 5981 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 70989 | |
| Uppercase Letter | 7384 | 9.4% |
| Space Separator | 238 | 0.3% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| i | 13568 | |
| e | 12027 | |
| n | 7798 | |
| d | 6661 | |
| a | 6264 | |
| t | 6257 | |
| l | 6014 | |
| s | 5774 | |
| m | 2493 | 3.5% |
| o | 2014 | 2.8% |
| Other values (7) | 2119 | 3.0% |
Uppercase Letter
| Value | Count | Frequency (%) |
| R | 5774 | |
| C | 1127 | 15.3% |
| L | 238 | 3.2% |
| A | 238 | 3.2% |
| M | 6 | 0.1% |
| E | 1 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 238 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 78373 | |
| Common | 238 | 0.3% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| i | 13568 | |
| e | 12027 | |
| n | 7798 | |
| d | 6661 | |
| a | 6264 | |
| t | 6257 | |
| l | 6014 | |
| R | 5774 | |
| s | 5774 | |
| m | 2493 | 3.2% |
| Other values (13) | 5743 |
Common
| Value | Count | Frequency (%) |
| 238 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 78611 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| i | 13568 | |
| e | 12027 | |
| n | 7798 | |
| d | 6661 | |
| a | 6264 | |
| t | 6257 | |
| l | 6014 | |
| R | 5774 | |
| s | 5774 | |
| m | 2493 | 3.2% |
| Other values (14) | 5981 |
taxkey
Real number (ℝ)
| Distinct | 7055 |
|---|---|
| Distinct (%) | 98.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.4687434 × 109 |
| Minimum | 30131000 |
|---|---|
| Maximum | 7.160375 × 109 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 56.0 KiB |
Quantile statistics
| Minimum | 30131000 |
|---|---|
| 5-th percentile | 1.1701463 × 109 |
| Q1 | 2.3110052 × 109 |
| median | 3.211215 × 109 |
| Q3 | 4.703136 × 109 |
| 95-th percentile | 5.8008152 × 109 |
| Maximum | 7.160375 × 109 |
| Range | 7.130244 × 109 |
| Interquartile range (IQR) | 2.3921308 × 109 |
Descriptive statistics
| Standard deviation | 1.4845672 × 109 |
|---|---|
| Coefficient of variation (CV) | 0.42798416 |
| Kurtosis | -0.69460439 |
| Mean | 3.4687434 × 109 |
| Median Absolute Deviation (MAD) | 1.0810275 × 109 |
| Skewness | 0.155261 |
| Sum | 2.478764 × 1013 |
| Variance | 2.2039399 × 1018 |
| Monotonicity | Increasing |
| Value | Count | Frequency (%) |
| 3211565100 | 2 | < 0.1% |
| 3460532000 | 2 | < 0.1% |
| 5290638000 | 2 | < 0.1% |
| 4591388000 | 2 | < 0.1% |
| 3141167100 | 2 | < 0.1% |
| 2550046000 | 2 | < 0.1% |
| 1730015000 | 2 | < 0.1% |
| 2550103000 | 2 | < 0.1% |
| 4590325000 | 2 | < 0.1% |
| 4590315000 | 2 | < 0.1% |
| Other values (7045) | 7126 |
| Value | Count | Frequency (%) |
| 30131000 | 1 | |
| 30152000 | 1 | |
| 49980110 | 1 | |
| 49993200 | 1 | |
| 50042000 | 1 | |
| 50074000 | 1 | |
| 50085000 | 1 | |
| 50089000 | 1 | |
| 70017000 | 1 | |
| 70036000 | 1 |
| Value | Count | Frequency (%) |
| 7160375000 | 1 | |
| 7160366000 | 1 | |
| 7160365000 | 1 | |
| 7160351000 | 1 | |
| 7160339000 | 1 | |
| 7160327000 | 1 | |
| 7160283000 | 1 | |
| 7160279000 | 1 | |
| 7160254000 | 1 | |
| 7160241000 | 1 |
Address
Text
| Distinct | 7055 |
|---|---|
| Distinct (%) | 98.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 517.2 KiB |
Length
| Max length | 37 |
|---|---|
| Median length | 31 |
| Mean length | 17.094318 |
| Min length | 12 |
Characters and Unicode
| Total characters | 122156 |
|---|---|
| Distinct characters | 55 |
| Distinct categories | 6 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 6964 ? |
|---|---|
| Unique (%) | 97.5% |
Sample
| 1st row | 9434-9446 N 107TH ST |
|---|---|
| 2nd row | 9306-9316 N 107TH ST |
| 3rd row | 9327 N SWAN RD |
| 4th row | 9411 W COUNTY LINE RD |
| 5th row | 9322 N JOYCE AV |
| Value | Count | Frequency (%) |
| st | 4604 | 15.2% |
| n | 3433 | 11.3% |
| w | 1699 | 5.6% |
| av | 1661 | 5.5% |
| s | 1596 | 5.3% |
| unit | 771 | 2.5% |
| e | 431 | 1.4% |
| pl | 285 | 0.9% |
| dr | 180 | 0.6% |
| rd | 122 | 0.4% |
| Other values (5383) | 15604 |
Most occurring characters
| Value | Count | Frequency (%) |
| 23240 | ||
| T | 9224 | 7.6% |
| S | 7467 | 6.1% |
| 2 | 5877 | 4.8% |
| 1 | 5623 | 4.6% |
| N | 5617 | 4.6% |
| 3 | 5300 | 4.3% |
| 4 | 4239 | 3.5% |
| 5 | 4130 | 3.4% |
| 0 | 4084 | 3.3% |
| Other values (45) | 47355 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 53509 | |
| Decimal Number | 41100 | |
| Space Separator | 23240 | |
| Lowercase Letter | 2337 | 1.9% |
| Dash Punctuation | 1196 | 1.0% |
| Other Punctuation | 774 | 0.6% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| T | 9224 | |
| S | 7467 | |
| N | 5617 | |
| A | 3975 | 7.4% |
| H | 3731 | 7.0% |
| E | 3127 | 5.8% |
| R | 2733 | 5.1% |
| W | 2303 | 4.3% |
| L | 2210 | 4.1% |
| V | 1972 | 3.7% |
| Other values (16) | 11150 |
Lowercase Letter
| Value | Count | Frequency (%) |
| t | 771 | |
| n | 771 | |
| i | 771 | |
| d | 6 | 0.3% |
| f | 4 | 0.2% |
| j | 3 | 0.1% |
| b | 2 | 0.1% |
| k | 2 | 0.1% |
| a | 2 | 0.1% |
| m | 1 | < 0.1% |
| Other values (4) | 4 | 0.2% |
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 5877 | |
| 1 | 5623 | |
| 3 | 5300 | |
| 4 | 4239 | |
| 5 | 4130 | |
| 0 | 4084 | |
| 6 | 3307 | |
| 7 | 3085 | |
| 8 | 2863 | |
| 9 | 2592 |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 771 | |
| # | 2 | 0.3% |
| \ | 1 | 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 23240 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 1196 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 66310 | |
| Latin | 55846 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| T | 9224 | |
| S | 7467 | |
| N | 5617 | |
| A | 3975 | 7.1% |
| H | 3731 | 6.7% |
| E | 3127 | 5.6% |
| R | 2733 | 4.9% |
| W | 2303 | 4.1% |
| L | 2210 | 4.0% |
| V | 1972 | 3.5% |
| Other values (30) | 13487 |
Common
| Value | Count | Frequency (%) |
| 23240 | ||
| 2 | 5877 | 8.9% |
| 1 | 5623 | 8.5% |
| 3 | 5300 | 8.0% |
| 4 | 4239 | 6.4% |
| 5 | 4130 | 6.2% |
| 0 | 4084 | 6.2% |
| 6 | 3307 | 5.0% |
| 7 | 3085 | 4.7% |
| 8 | 2863 | 4.3% |
| Other values (5) | 4562 | 6.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 122156 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 23240 | ||
| T | 9224 | 7.6% |
| S | 7467 | 6.1% |
| 2 | 5877 | 4.8% |
| 1 | 5623 | 4.6% |
| N | 5617 | 4.6% |
| 3 | 5300 | 4.3% |
| 4 | 4239 | 3.5% |
| 5 | 4130 | 3.4% |
| 0 | 4084 | 3.3% |
| Other values (45) | 47355 |
CondoProject
Text
MISSING 
| Distinct | 202 |
|---|---|
| Distinct (%) | 22.8% |
| Missing | 6261 |
| Missing (%) | 87.6% |
| Memory size | 260.0 KiB |
Length
| Max length | 35 |
|---|---|
| Median length | 26 |
| Mean length | 17.267797 |
| Min length | 5 |
Characters and Unicode
| Total characters | 15282 |
|---|---|
| Distinct characters | 47 |
| Distinct categories | 8 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 72 ? |
|---|---|
| Unique (%) | 8.1% |
Sample
| 1st row | NORTHRIDGE WOOD LAKE |
|---|---|
| 2nd row | NORTHRIDGE WOOD LAKE |
| 3rd row | NORTHRIDGE WOOD LAKE |
| 4th row | NORTHRIDGE WOOD LAKE |
| 5th row | NORTHRIDGE WOOD LAKE |
| Value | Count | Frequency (%) |
| condominium | 100 | 4.6% |
| on | 89 | 4.1% |
| lofts | 77 | 3.5% |
| the | 70 | 3.2% |
| lake | 66 | 3.0% |
| river | 58 | 2.7% |
| condos | 55 | 2.5% |
| condominiums | 55 | 2.5% |
| terrace | 39 | 1.8% |
| point | 32 | 1.5% |
| Other values (250) | 1538 |
Most occurring characters
| Value | Count | Frequency (%) |
| E | 1391 | 9.1% |
| 1353 | 8.9% | |
| O | 1352 | 8.8% |
| R | 1153 | 7.5% |
| N | 1055 | 6.9% |
| I | 1006 | 6.6% |
| A | 960 | 6.3% |
| T | 796 | 5.2% |
| S | 795 | 5.2% |
| L | 757 | 5.0% |
| Other values (37) | 4664 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 13616 | |
| Space Separator | 1353 | 8.9% |
| Decimal Number | 128 | 0.8% |
| Close Punctuation | 65 | 0.4% |
| Open Punctuation | 65 | 0.4% |
| Dash Punctuation | 39 | 0.3% |
| Other Punctuation | 9 | 0.1% |
| Lowercase Letter | 7 | < 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| E | 1391 | 10.2% |
| O | 1352 | 9.9% |
| R | 1153 | 8.5% |
| N | 1055 | 7.7% |
| I | 1006 | 7.4% |
| A | 960 | 7.1% |
| T | 796 | 5.8% |
| S | 795 | 5.8% |
| L | 757 | 5.6% |
| D | 599 | 4.4% |
| Other values (16) | 3752 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 37 | |
| 2 | 30 | |
| 0 | 20 | |
| 5 | 18 | |
| 6 | 18 | |
| 3 | 2 | 1.6% |
| 4 | 1 | 0.8% |
| 8 | 1 | 0.8% |
| 7 | 1 | 0.8% |
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 2 | |
| a | 2 | |
| n | 1 | |
| l | 1 | |
| c | 1 |
Other Punctuation
| Value | Count | Frequency (%) |
| & | 4 | |
| / | 3 | |
| ' | 2 |
Space Separator
| Value | Count | Frequency (%) |
| 1353 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 65 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 65 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 39 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 13623 | |
| Common | 1659 | 10.9% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| E | 1391 | 10.2% |
| O | 1352 | 9.9% |
| R | 1153 | 8.5% |
| N | 1055 | 7.7% |
| I | 1006 | 7.4% |
| A | 960 | 7.0% |
| T | 796 | 5.8% |
| S | 795 | 5.8% |
| L | 757 | 5.6% |
| D | 599 | 4.4% |
| Other values (21) | 3759 |
Common
| Value | Count | Frequency (%) |
| 1353 | ||
| ) | 65 | 3.9% |
| ( | 65 | 3.9% |
| - | 39 | 2.4% |
| 1 | 37 | 2.2% |
| 2 | 30 | 1.8% |
| 0 | 20 | 1.2% |
| 5 | 18 | 1.1% |
| 6 | 18 | 1.1% |
| & | 4 | 0.2% |
| Other values (6) | 10 | 0.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 15282 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| E | 1391 | 9.1% |
| 1353 | 8.9% | |
| O | 1352 | 8.8% |
| R | 1153 | 7.5% |
| N | 1055 | 6.9% |
| I | 1006 | 6.6% |
| A | 960 | 6.3% |
| T | 796 | 5.2% |
| S | 795 | 5.2% |
| L | 757 | 5.0% |
| Other values (37) | 4664 |
District
Real number (ℝ)
| Distinct | 15 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 7.8376714 |
| Minimum | 1 |
|---|---|
| Maximum | 15 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 56.0 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 4 |
| median | 8 |
| Q3 | 11 |
| 95-th percentile | 14 |
| Maximum | 15 |
| Range | 14 |
| Interquartile range (IQR) | 7 |
Descriptive statistics
| Standard deviation | 4.2621201 |
|---|---|
| Coefficient of variation (CV) | 0.54379929 |
| Kurtosis | -1.2297748 |
| Mean | 7.8376714 |
| Median Absolute Deviation (MAD) | 3 |
| Skewness | 0.01487817 |
| Sum | 56008 |
| Variance | 18.165668 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 5 | 778 | |
| 11 | 612 | 8.6% |
| 10 | 586 | 8.2% |
| 14 | 562 | 7.9% |
| 2 | 531 | 7.4% |
| 7 | 530 | 7.4% |
| 13 | 524 | 7.3% |
| 3 | 513 | 7.2% |
| 9 | 488 | 6.8% |
| 1 | 468 | 6.5% |
| Other values (5) | 1554 |
| Value | Count | Frequency (%) |
| 1 | 468 | |
| 2 | 531 | |
| 3 | 513 | |
| 4 | 322 | |
| 5 | 778 | |
| 6 | 366 | |
| 7 | 530 | |
| 8 | 273 | 3.8% |
| 9 | 488 | |
| 10 | 586 |
| Value | Count | Frequency (%) |
| 15 | 297 | |
| 14 | 562 | |
| 13 | 524 | |
| 12 | 296 | |
| 11 | 612 | |
| 10 | 586 | |
| 9 | 488 | |
| 8 | 273 | |
| 7 | 530 | |
| 6 | 366 |
nbhd
Real number (ℝ)
| Distinct | 459 |
|---|---|
| Distinct (%) | 6.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3338.4563 |
| Minimum | 40 |
|---|---|
| Maximum | 24910 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 56.0 KiB |
Quantile statistics
| Minimum | 40 |
|---|---|
| 5-th percentile | 780 |
| Q1 | 1780 |
| median | 3060 |
| Q3 | 4620 |
| 95-th percentile | 6277 |
| Maximum | 24910 |
| Range | 24870 |
| Interquartile range (IQR) | 2840 |
Descriptive statistics
| Standard deviation | 1795.1757 |
|---|---|
| Coefficient of variation (CV) | 0.53772626 |
| Kurtosis | 1.7286574 |
| Mean | 3338.4563 |
| Median Absolute Deviation (MAD) | 1520 |
| Skewness | 0.36972966 |
| Sum | 23856609 |
| Variance | 3222655.6 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2100 | 156 | 2.2% |
| 2080 | 132 | 1.8% |
| 4520 | 124 | 1.7% |
| 4120 | 119 | 1.7% |
| 1140 | 118 | 1.7% |
| 4420 | 101 | 1.4% |
| 4240 | 99 | 1.4% |
| 4340 | 98 | 1.4% |
| 4620 | 93 | 1.3% |
| 1440 | 92 | 1.3% |
| Other values (449) | 6014 |
| Value | Count | Frequency (%) |
| 40 | 14 | 0.2% |
| 50 | 4 | 0.1% |
| 240 | 61 | |
| 360 | 31 | |
| 380 | 14 | 0.2% |
| 440 | 43 | |
| 480 | 71 | |
| 520 | 8 | 0.1% |
| 560 | 41 | |
| 600 | 22 | 0.3% |
| Value | Count | Frequency (%) |
| 24910 | 1 | < 0.1% |
| 6982 | 1 | < 0.1% |
| 6981 | 1 | < 0.1% |
| 6980 | 1 | < 0.1% |
| 6979 | 1 | < 0.1% |
| 6978 | 1 | < 0.1% |
| 6977 | 2 | |
| 6976 | 1 | < 0.1% |
| 6974 | 4 | |
| 6973 | 1 | < 0.1% |
Style
Text
| Distinct | 81 |
|---|---|
| Distinct (%) | 1.1% |
| Missing | 21 |
| Missing (%) | 0.3% |
| Memory size | 484.5 KiB |
Length
| Max length | 50 |
|---|---|
| Median length | 48 |
| Mean length | 12.516491 |
| Min length | 5 |
Characters and Unicode
| Total characters | 89180 |
|---|---|
| Distinct characters | 63 |
| Distinct categories | 9 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 13 ? |
|---|---|
| Unique (%) | 0.2% |
Sample
| 1st row | Service Building |
|---|---|
| 2nd row | Office Building - 1 Story |
| 3rd row | Ranch |
| 4th row | Ranch |
| 5th row | Ranch |
| Value | Count | Frequency (%) |
| ranch | 1507 | 8.8% |
| o/s | 1174 | 6.8% |
| cod | 1006 | 5.9% |
| cape | 1006 | 5.9% |
| 999 | 5.8% | |
| duplex | 954 | 5.6% |
| bungalow | 871 | 5.1% |
| rise | 681 | 4.0% |
| stories | 681 | 4.0% |
| res | 628 | 3.7% |
| Other values (146) | 7656 |
Most occurring characters
| Value | Count | Frequency (%) |
| 10040 | 11.3% | |
| e | 6566 | 7.4% |
| o | 5544 | 6.2% |
| a | 4859 | 5.4% |
| n | 4344 | 4.9% |
| l | 4294 | 4.8% |
| i | 3979 | 4.5% |
| t | 3052 | 3.4% |
| C | 3005 | 3.4% |
| p | 2981 | 3.3% |
| Other values (53) | 40516 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 54926 | |
| Uppercase Letter | 16158 | 18.1% |
| Space Separator | 10040 | 11.3% |
| Decimal Number | 3547 | 4.0% |
| Other Punctuation | 2687 | 3.0% |
| Dash Punctuation | 1017 | 1.1% |
| Open Punctuation | 303 | 0.3% |
| Math Symbol | 260 | 0.3% |
| Close Punctuation | 242 | 0.3% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 6566 | |
| o | 5544 | |
| a | 4859 | 8.8% |
| n | 4344 | 7.9% |
| l | 4294 | 7.8% |
| i | 3979 | 7.2% |
| t | 3052 | 5.6% |
| p | 2981 | 5.4% |
| u | 2972 | 5.4% |
| s | 2913 | 5.3% |
| Other values (15) | 13422 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 3005 | |
| R | 2926 | |
| S | 2678 | |
| D | 1450 | |
| O | 1283 | |
| B | 1115 | 6.9% |
| A | 993 | 6.1% |
| M | 988 | 6.1% |
| T | 387 | 2.4% |
| N | 384 | 2.4% |
| Other values (10) | 949 | 5.9% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 1449 | |
| 2 | 1114 | |
| 4 | 509 | 14.4% |
| 3 | 257 | 7.2% |
| 6 | 161 | 4.5% |
| 7 | 41 | 1.2% |
| 0 | 13 | 0.4% |
| 8 | 3 | 0.1% |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 2002 | |
| & | 589 | 21.9% |
| , | 88 | 3.3% |
| . | 8 | 0.3% |
Math Symbol
| Value | Count | Frequency (%) |
| + | 162 | |
| > | 98 |
Space Separator
| Value | Count | Frequency (%) |
| 10040 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 1017 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 303 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 242 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 71084 | |
| Common | 18096 | 20.3% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 6566 | 9.2% |
| o | 5544 | 7.8% |
| a | 4859 | 6.8% |
| n | 4344 | 6.1% |
| l | 4294 | 6.0% |
| i | 3979 | 5.6% |
| t | 3052 | 4.3% |
| C | 3005 | 4.2% |
| p | 2981 | 4.2% |
| u | 2972 | 4.2% |
| Other values (35) | 29488 |
Common
| Value | Count | Frequency (%) |
| 10040 | ||
| / | 2002 | 11.1% |
| 1 | 1449 | 8.0% |
| 2 | 1114 | 6.2% |
| - | 1017 | 5.6% |
| & | 589 | 3.3% |
| 4 | 509 | 2.8% |
| ( | 303 | 1.7% |
| 3 | 257 | 1.4% |
| ) | 242 | 1.3% |
| Other values (8) | 574 | 3.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 89180 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 10040 | 11.3% | |
| e | 6566 | 7.4% |
| o | 5544 | 6.2% |
| a | 4859 | 5.4% |
| n | 4344 | 4.9% |
| l | 4294 | 4.8% |
| i | 3979 | 4.5% |
| t | 3052 | 3.4% |
| C | 3005 | 3.4% |
| p | 2981 | 3.3% |
| Other values (53) | 40516 |
Extwall
Categorical
MISSING 
| Distinct | 18 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 926 |
| Missing (%) | 13.0% |
| Memory size | 471.1 KiB |
| Aluminum/Vinyl | |
|---|---|
| Brick | |
| Wood | 331 |
| Asphalt/Other | 313 |
| Stone | 179 |
| Other values (13) |
Length
| Max length | 23 |
|---|---|
| Median length | 14 |
| Mean length | 11.009646 |
| Min length | 4 |
Characters and Unicode
| Total characters | 68480 |
|---|---|
| Distinct characters | 32 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 2 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Concrete Block |
|---|---|
| 2nd row | Brick |
| 3rd row | Aluminum/Vinyl |
| 4th row | Aluminum/Vinyl |
| 5th row | Aluminum/Vinyl |
Common Values
| Value | Count | Frequency (%) |
| Aluminum/Vinyl | 3468 | |
| Brick | 1408 | |
| Wood | 331 | 4.6% |
| Asphalt/Other | 313 | 4.4% |
| Stone | 179 | 2.5% |
| Masonry/Frame | 154 | 2.2% |
| Stucco | 93 | 1.3% |
| Concrete Block | 77 | 1.1% |
| Fiber Cement/Hardiplank | 49 | 0.7% |
| Alum/Vynyl Siding | 46 | 0.6% |
| Other values (8) | 102 | 1.4% |
| (Missing) | 926 | 13.0% |
Length
| Value | Count | Frequency (%) |
| aluminum/vinyl | 3468 | |
| brick | 1410 | |
| wood | 342 | 5.3% |
| asphalt/other | 313 | 4.8% |
| stone | 179 | 2.8% |
| masonry/frame | 154 | 2.4% |
| block | 104 | 1.6% |
| stucco | 93 | 1.4% |
| concrete | 77 | 1.2% |
| siding | 58 | 0.9% |
| Other values (10) | 262 | 4.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| i | 8560 | |
| n | 7591 | |
| l | 7506 | |
| m | 7224 | |
| u | 7075 | |
| / | 4030 | 5.9% |
| A | 3827 | 5.6% |
| y | 3755 | 5.5% |
| V | 3514 | 5.1% |
| r | 2310 | 3.4% |
| Other values (22) | 13088 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 53722 | |
| Uppercase Letter | 10488 | 15.3% |
| Other Punctuation | 4030 | 5.9% |
| Space Separator | 240 | 0.4% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| i | 8560 | |
| n | 7591 | |
| l | 7506 | |
| m | 7224 | |
| u | 7075 | |
| y | 3755 | |
| r | 2310 | 4.3% |
| c | 1791 | 3.3% |
| k | 1563 | 2.9% |
| o | 1334 | 2.5% |
| Other values (9) | 5013 |
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 3827 | |
| V | 3514 | |
| B | 1514 | 14.4% |
| W | 342 | 3.3% |
| S | 330 | 3.1% |
| O | 323 | 3.1% |
| F | 231 | 2.2% |
| M | 207 | 2.0% |
| C | 126 | 1.2% |
| H | 49 | 0.5% |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 4030 |
Space Separator
| Value | Count | Frequency (%) |
| 240 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 64210 | |
| Common | 4270 | 6.2% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| i | 8560 | |
| n | 7591 | |
| l | 7506 | |
| m | 7224 | |
| u | 7075 | |
| A | 3827 | 6.0% |
| y | 3755 | 5.8% |
| V | 3514 | 5.5% |
| r | 2310 | 3.6% |
| c | 1791 | 2.8% |
| Other values (20) | 11057 |
Common
| Value | Count | Frequency (%) |
| / | 4030 | |
| 240 | 5.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 68480 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| i | 8560 | |
| n | 7591 | |
| l | 7506 | |
| m | 7224 | |
| u | 7075 | |
| / | 4030 | 5.9% |
| A | 3827 | 5.6% |
| y | 3755 | 5.5% |
| V | 3514 | 5.1% |
| r | 2310 | 3.4% |
| Other values (22) | 13088 |
Stories
Real number (ℝ)
| Distinct | 13 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 39 |
| Missing (%) | 0.5% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.3848319 |
| Minimum | 0 |
|---|---|
| Maximum | 14 |
| Zeros | 21 |
| Zeros (%) | 0.3% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 56.0 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| median | 1 |
| Q3 | 2 |
| 95-th percentile | 2 |
| Maximum | 14 |
| Range | 14 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 0.5348119 |
|---|---|
| Coefficient of variation (CV) | 0.38619266 |
| Kurtosis | 69.638741 |
| Mean | 1.3848319 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 4.044354 |
| Sum | 9842 |
| Variance | 0.28602376 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 3955 | |
| 2 | 2008 | |
| 1.5 | 1015 | 14.2% |
| 3 | 52 | 0.7% |
| 2.5 | 40 | 0.6% |
| 0 | 21 | 0.3% |
| 4 | 7 | 0.1% |
| 5 | 3 | < 0.1% |
| 7 | 2 | < 0.1% |
| 12 | 1 | < 0.1% |
| Other values (3) | 3 | < 0.1% |
| (Missing) | 39 | 0.5% |
| Value | Count | Frequency (%) |
| 0 | 21 | 0.3% |
| 1 | 3955 | |
| 1.5 | 1015 | 14.2% |
| 2 | 2008 | |
| 2.5 | 40 | 0.6% |
| 3 | 52 | 0.7% |
| 3.5 | 1 | < 0.1% |
| 4 | 7 | 0.1% |
| 5 | 3 | < 0.1% |
| 6 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 14 | 1 | < 0.1% |
| 12 | 1 | < 0.1% |
| 7 | 2 | < 0.1% |
| 6 | 1 | < 0.1% |
| 5 | 3 | < 0.1% |
| 4 | 7 | 0.1% |
| 3.5 | 1 | < 0.1% |
| 3 | 52 | 0.7% |
| 2.5 | 40 | 0.6% |
| 2 | 2008 |
Year_Built
Real number (ℝ)
| Distinct | 155 |
|---|---|
| Distinct (%) | 2.2% |
| Missing | 11 |
| Missing (%) | 0.2% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1936.1706 |
| Minimum | 0 |
|---|---|
| Maximum | 2022 |
| Zeros | 20 |
| Zeros (%) | 0.3% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 56.0 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 1892 |
| Q1 | 1921 |
| median | 1948 |
| Q3 | 1958 |
| 95-th percentile | 1999 |
| Maximum | 2022 |
| Range | 2022 |
| Interquartile range (IQR) | 37 |
Descriptive statistics
| Standard deviation | 106.7051 |
|---|---|
| Coefficient of variation (CV) | 0.055111416 |
| Kurtosis | 301.17114 |
| Mean | 1936.1706 |
| Median Absolute Deviation (MAD) | 20 |
| Skewness | -16.740934 |
| Sum | 13814577 |
| Variance | 11385.979 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1955 | 249 | 3.5% |
| 1952 | 212 | 3.0% |
| 1951 | 196 | 2.7% |
| 1954 | 195 | 2.7% |
| 1953 | 193 | 2.7% |
| 1956 | 183 | 2.6% |
| 1950 | 178 | 2.5% |
| 1957 | 177 | 2.5% |
| 1958 | 168 | 2.4% |
| 1926 | 138 | 1.9% |
| Other values (145) | 5246 |
| Value | Count | Frequency (%) |
| 0 | 20 | |
| 1836 | 1 | < 0.1% |
| 1843 | 1 | < 0.1% |
| 1855 | 2 | < 0.1% |
| 1860 | 2 | < 0.1% |
| 1861 | 2 | < 0.1% |
| 1865 | 3 | < 0.1% |
| 1868 | 1 | < 0.1% |
| 1869 | 1 | < 0.1% |
| 1870 | 15 |
| Value | Count | Frequency (%) |
| 2022 | 6 | |
| 2020 | 1 | < 0.1% |
| 2019 | 1 | < 0.1% |
| 2018 | 3 | |
| 2017 | 2 | < 0.1% |
| 2016 | 4 | |
| 2015 | 1 | < 0.1% |
| 2014 | 2 | < 0.1% |
| 2013 | 3 | |
| 2012 | 1 | < 0.1% |
Rooms
Real number (ℝ)
MISSING  ZEROS 
| Distinct | 40 |
|---|---|
| Distinct (%) | 0.6% |
| Missing | 443 |
| Missing (%) | 6.2% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 7.7193794 |
| Minimum | 0 |
|---|---|
| Maximum | 63 |
| Zeros | 122 |
| Zeros (%) | 1.7% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 56.0 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 4 |
| Q1 | 5 |
| median | 7 |
| Q3 | 10 |
| 95-th percentile | 14.9 |
| Maximum | 63 |
| Range | 63 |
| Interquartile range (IQR) | 5 |
Descriptive statistics
| Standard deviation | 4.1556761 |
|---|---|
| Coefficient of variation (CV) | 0.53834329 |
| Kurtosis | 15.325323 |
| Mean | 7.7193794 |
| Median Absolute Deviation (MAD) | 2 |
| Skewness | 2.4306361 |
| Sum | 51743 |
| Variance | 17.269644 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 5 | 1427 | |
| 6 | 991 | |
| 10 | 905 | |
| 4 | 644 | |
| 8 | 573 | |
| 7 | 512 | 7.2% |
| 12 | 410 | 5.7% |
| 9 | 305 | 4.3% |
| 14 | 141 | 2.0% |
| 11 | 137 | 1.9% |
| Other values (30) | 658 | |
| (Missing) | 443 | 6.2% |
| Value | Count | Frequency (%) |
| 0 | 122 | 1.7% |
| 1 | 6 | 0.1% |
| 2 | 21 | 0.3% |
| 3 | 128 | 1.8% |
| 4 | 644 | |
| 5 | 1427 | |
| 6 | 991 | |
| 7 | 512 | 7.2% |
| 8 | 573 | |
| 9 | 305 | 4.3% |
| Value | Count | Frequency (%) |
| 63 | 1 | < 0.1% |
| 62 | 1 | < 0.1% |
| 45 | 1 | < 0.1% |
| 40 | 2 | < 0.1% |
| 39 | 1 | < 0.1% |
| 38 | 1 | < 0.1% |
| 36 | 1 | < 0.1% |
| 33 | 1 | < 0.1% |
| 32 | 2 | < 0.1% |
| 30 | 8 |
FinishedSqft
Real number (ℝ)
| Distinct | 2386 |
|---|---|
| Distinct (%) | 33.5% |
| Missing | 24 |
| Missing (%) | 0.3% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2334.2627 |
| Minimum | 0 |
|---|---|
| Maximum | 245266 |
| Zeros | 6 |
| Zeros (%) | 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 56.0 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 787 |
| Q1 | 1082 |
| median | 1402.5 |
| Q3 | 2014 |
| 95-th percentile | 3672 |
| Maximum | 245266 |
| Range | 245266 |
| Interquartile range (IQR) | 932 |
Descriptive statistics
| Standard deviation | 8425.9847 |
|---|---|
| Coefficient of variation (CV) | 3.6096986 |
| Kurtosis | 406.49186 |
| Mean | 2334.2627 |
| Median Absolute Deviation (MAD) | 397.5 |
| Skewness | 18.519566 |
| Sum | 16624619 |
| Variance | 70997219 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1056 | 30 | 0.4% |
| 936 | 30 | 0.4% |
| 864 | 29 | 0.4% |
| 1120 | 26 | 0.4% |
| 672 | 26 | 0.4% |
| 1054 | 25 | 0.3% |
| 1140 | 23 | 0.3% |
| 2040 | 22 | 0.3% |
| 980 | 21 | 0.3% |
| 912 | 18 | 0.3% |
| Other values (2376) | 6872 | |
| (Missing) | 24 | 0.3% |
| Value | Count | Frequency (%) |
| 0 | 6 | |
| 325 | 4 | |
| 405 | 2 | < 0.1% |
| 430 | 2 | < 0.1% |
| 460 | 3 | |
| 476 | 1 | < 0.1% |
| 496 | 1 | < 0.1% |
| 498 | 2 | < 0.1% |
| 500 | 1 | < 0.1% |
| 508 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 245266 | 1 | |
| 232960 | 1 | |
| 210744 | 1 | |
| 202568 | 1 | |
| 196753 | 1 | |
| 170090 | 1 | |
| 156025 | 1 | |
| 141787 | 1 | |
| 139280 | 1 | |
| 127812 | 1 |
Units
Real number (ℝ)
SKEWED 
| Distinct | 49 |
|---|---|
| Distinct (%) | 0.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.0100756 |
| Minimum | 0 |
|---|---|
| Maximum | 737 |
| Zeros | 29 |
| Zeros (%) | 0.4% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 56.0 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| median | 1 |
| Q3 | 2 |
| 95-th percentile | 3 |
| Maximum | 737 |
| Range | 737 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 14.166496 |
|---|---|
| Coefficient of variation (CV) | 7.047743 |
| Kurtosis | 2070.7177 |
| Mean | 2.0100756 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 42.806223 |
| Sum | 14364 |
| Variance | 200.68961 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 5070 | |
| 2 | 1564 | 21.9% |
| 4 | 166 | 2.3% |
| 3 | 135 | 1.9% |
| 8 | 40 | 0.6% |
| 0 | 29 | 0.4% |
| 6 | 28 | 0.4% |
| 5 | 25 | 0.3% |
| 7 | 16 | 0.2% |
| 12 | 8 | 0.1% |
| Other values (39) | 65 | 0.9% |
| Value | Count | Frequency (%) |
| 0 | 29 | 0.4% |
| 1 | 5070 | |
| 2 | 1564 | 21.9% |
| 3 | 135 | 1.9% |
| 4 | 166 | 2.3% |
| 5 | 25 | 0.3% |
| 6 | 28 | 0.4% |
| 7 | 16 | 0.2% |
| 8 | 40 | 0.6% |
| 9 | 5 | 0.1% |
| Value | Count | Frequency (%) |
| 737 | 1 | |
| 725 | 1 | |
| 389 | 1 | |
| 300 | 1 | |
| 116 | 1 | |
| 115 | 1 | |
| 101 | 1 | |
| 99 | 1 | |
| 94 | 2 | |
| 84 | 1 |
Bdrms
Real number (ℝ)
MISSING 
| Distinct | 24 |
|---|---|
| Distinct (%) | 0.4% |
| Missing | 443 |
| Missing (%) | 6.2% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.9258541 |
| Minimum | 0 |
|---|---|
| Maximum | 32 |
| Zeros | 18 |
| Zeros (%) | 0.3% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 56.0 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 2 |
| Q1 | 3 |
| median | 3 |
| Q3 | 5 |
| 95-th percentile | 8 |
| Maximum | 32 |
| Range | 32 |
| Interquartile range (IQR) | 2 |
Descriptive statistics
| Standard deviation | 2.0797352 |
|---|---|
| Coefficient of variation (CV) | 0.52975355 |
| Kurtosis | 17.125512 |
| Mean | 3.9258541 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 2.6006406 |
| Sum | 26315 |
| Variance | 4.3252983 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 3 | 2200 | |
| 4 | 1484 | |
| 2 | 1037 | |
| 6 | 869 | 12.2% |
| 5 | 371 | 5.2% |
| 1 | 242 | 3.4% |
| 8 | 215 | 3.0% |
| 7 | 90 | 1.3% |
| 10 | 53 | 0.7% |
| 12 | 44 | 0.6% |
| Other values (14) | 98 | 1.4% |
| (Missing) | 443 | 6.2% |
| Value | Count | Frequency (%) |
| 0 | 18 | 0.3% |
| 1 | 242 | 3.4% |
| 2 | 1037 | |
| 3 | 2200 | |
| 4 | 1484 | |
| 5 | 371 | 5.2% |
| 6 | 869 | 12.2% |
| 7 | 90 | 1.3% |
| 8 | 215 | 3.0% |
| 9 | 38 | 0.5% |
| Value | Count | Frequency (%) |
| 32 | 1 | < 0.1% |
| 29 | 1 | < 0.1% |
| 28 | 1 | < 0.1% |
| 25 | 1 | < 0.1% |
| 21 | 1 | < 0.1% |
| 20 | 1 | < 0.1% |
| 18 | 4 | 0.1% |
| 16 | 1 | < 0.1% |
| 15 | 4 | 0.1% |
| 14 | 10 |
Fbath
Real number (ℝ)
ZEROS 
| Distinct | 8 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.4354884 |
| Minimum | 0 |
|---|---|
| Maximum | 7 |
| Zeros | 509 |
| Zeros (%) | 7.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 56.0 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 1 |
| median | 1 |
| Q3 | 2 |
| 95-th percentile | 2 |
| Maximum | 7 |
| Range | 7 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 0.71871514 |
|---|---|
| Coefficient of variation (CV) | 0.50067639 |
| Kurtosis | 1.2512878 |
| Mean | 1.4354884 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 0.36124013 |
| Sum | 10258 |
| Variance | 0.51655145 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 3409 | |
| 2 | 2885 | |
| 0 | 509 | 7.1% |
| 3 | 304 | 4.3% |
| 4 | 31 | 0.4% |
| 5 | 6 | 0.1% |
| 7 | 1 | < 0.1% |
| 6 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 0 | 509 | 7.1% |
| 1 | 3409 | |
| 2 | 2885 | |
| 3 | 304 | 4.3% |
| 4 | 31 | 0.4% |
| 5 | 6 | 0.1% |
| 6 | 1 | < 0.1% |
| 7 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 7 | 1 | < 0.1% |
| 6 | 1 | < 0.1% |
| 5 | 6 | 0.1% |
| 4 | 31 | 0.4% |
| 3 | 304 | 4.3% |
| 2 | 2885 | |
| 1 | 3409 | |
| 0 | 509 | 7.1% |
Hbath
Categorical
IMBALANCE 
| Distinct | 4 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 404.9 KiB |
| 0 | |
|---|---|
| 1 | |
| 2 | 164 |
| 3 | 6 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 7146 |
|---|---|
| Distinct characters | 4 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 1 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 5183 | |
| 1 | 1793 | 25.1% |
| 2 | 164 | 2.3% |
| 3 | 6 | 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0 | 5183 | |
| 1 | 1793 | 25.1% |
| 2 | 164 | 2.3% |
| 3 | 6 | 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 5183 | |
| 1 | 1793 | 25.1% |
| 2 | 164 | 2.3% |
| 3 | 6 | 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 7146 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 5183 | |
| 1 | 1793 | 25.1% |
| 2 | 164 | 2.3% |
| 3 | 6 | 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 7146 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 5183 | |
| 1 | 1793 | 25.1% |
| 2 | 164 | 2.3% |
| 3 | 6 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 7146 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 5183 | |
| 1 | 1793 | 25.1% |
| 2 | 164 | 2.3% |
| 3 | 6 | 0.1% |
Lotsize
Real number (ℝ)
SKEWED  ZEROS 
| Distinct | 1670 |
|---|---|
| Distinct (%) | 23.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 6676.4801 |
| Minimum | 0 |
|---|---|
| Maximum | 1341648 |
| Zeros | 489 |
| Zeros (%) | 6.8% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 56.0 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 3660 |
| median | 5002 |
| Q3 | 6750 |
| 95-th percentile | 11314.75 |
| Maximum | 1341648 |
| Range | 1341648 |
| Interquartile range (IQR) | 3090 |
Descriptive statistics
| Standard deviation | 24988.764 |
|---|---|
| Coefficient of variation (CV) | 3.7428051 |
| Kurtosis | 1483.0128 |
| Mean | 6676.4801 |
| Median Absolute Deviation (MAD) | 1402 |
| Skewness | 33.886178 |
| Sum | 47710127 |
| Variance | 6.2443833 × 108 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 489 | 6.8% |
| 1 | 456 | 6.4% |
| 4800 | 419 | 5.9% |
| 3600 | 341 | 4.8% |
| 6000 | 171 | 2.4% |
| 5400 | 136 | 1.9% |
| 7200 | 135 | 1.9% |
| 5000 | 117 | 1.6% |
| 4920 | 95 | 1.3% |
| 4200 | 92 | 1.3% |
| Other values (1660) | 4695 |
| Value | Count | Frequency (%) |
| 0 | 489 | |
| 1 | 456 | |
| 75 | 1 | < 0.1% |
| 613 | 1 | < 0.1% |
| 929 | 1 | < 0.1% |
| 1050 | 1 | < 0.1% |
| 1080 | 1 | < 0.1% |
| 1084 | 1 | < 0.1% |
| 1098 | 1 | < 0.1% |
| 1120 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 1341648 | 1 | |
| 835916 | 1 | |
| 788775 | 1 | |
| 429109 | 1 | |
| 409333 | 1 | |
| 388990 | 1 | |
| 306096 | 1 | |
| 277825 | 1 | |
| 261360 | 1 | |
| 243848 | 1 |
Sale_date
Date
| Distinct | 313 |
|---|---|
| Distinct (%) | 4.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 56.0 KiB |
| Minimum | 2022-01-01 00:00:00 |
|---|---|
| Maximum | 2022-12-30 00:00:00 |
Sale_price
Real number (ℝ)
| Distinct | 1284 |
|---|---|
| Distinct (%) | 18.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 271544.97 |
| Minimum | 4000 |
|---|---|
| Maximum | 21850000 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 56.0 KiB |
Quantile statistics
| Minimum | 4000 |
|---|---|
| 5-th percentile | 64000 |
| Q1 | 131000 |
| median | 195000 |
| Q3 | 260000 |
| 95-th percentile | 481720 |
| Maximum | 21850000 |
| Range | 21846000 |
| Interquartile range (IQR) | 129000 |
Descriptive statistics
| Standard deviation | 770141.28 |
|---|---|
| Coefficient of variation (CV) | 2.8361464 |
| Kurtosis | 399.43511 |
| Mean | 271544.97 |
| Median Absolute Deviation (MAD) | 65000 |
| Skewness | 18.427415 |
| Sum | 1.9404603 × 109 |
| Variance | 5.931176 × 1011 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 200000 | 117 | 1.6% |
| 250000 | 117 | 1.6% |
| 220000 | 104 | 1.5% |
| 160000 | 101 | 1.4% |
| 225000 | 99 | 1.4% |
| 150000 | 95 | 1.3% |
| 180000 | 94 | 1.3% |
| 190000 | 93 | 1.3% |
| 175000 | 91 | 1.3% |
| 210000 | 90 | 1.3% |
| Other values (1274) | 6145 |
| Value | Count | Frequency (%) |
| 4000 | 1 | |
| 5000 | 1 | |
| 7000 | 1 | |
| 9000 | 1 | |
| 10000 | 2 | |
| 11000 | 1 | |
| 12500 | 1 | |
| 15000 | 2 | |
| 16000 | 1 | |
| 18000 | 1 |
| Value | Count | Frequency (%) |
| 21850000 | 1 | |
| 20828000 | 1 | |
| 20000000 | 1 | |
| 17400000 | 1 | |
| 17225000 | 1 | |
| 14600000 | 2 | |
| 14500000 | 1 | |
| 14450000 | 1 | |
| 14250000 | 1 | |
| 13735000 | 1 |
| PropertyID | PropType | taxkey | Address | CondoProject | District | nbhd | Style | Extwall | Stories | Year_Built | Rooms | FinishedSqft | Units | Bdrms | Fbath | Hbath | Lotsize | Sale_date | Sale_price | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 98461 | Manufacturing | 30131000 | 9434-9446 N 107TH ST | NaN | 9 | 6300 | Service Building | Concrete Block | 1.0 | 1978.0 | NaN | 20600.0 | 6 | NaN | 0 | 0 | 0 | 2022-04-01 | 950000.0 |
| 1 | 98464 | Commercial | 30152000 | 9306-9316 N 107TH ST | NaN | 9 | 6202 | Office Building - 1 Story | Brick | 1.0 | 1982.0 | NaN | 9688.0 | 23 | NaN | 0 | 0 | 35719 | 2022-10-07 | 385000.0 |
| 2 | 98508 | Residential | 49980110 | 9327 N SWAN RD | NaN | 9 | 40 | NaN | NaN | NaN | NaN | NaN | NaN | 0 | NaN | 0 | 0 | 1341648 | 2022-01-07 | 800000.0 |
| 3 | 98519 | Residential | 49993200 | 9411 W COUNTY LINE RD | NaN | 9 | 40 | Ranch | Aluminum/Vinyl | 1.0 | 1959.0 | 6.0 | 1334.0 | 1 | 3.0 | 1 | 1 | 83200 | 2022-08-09 | 280000.0 |
| 4 | 98561 | Residential | 50042000 | 9322 N JOYCE AV | NaN | 9 | 40 | Ranch | Aluminum/Vinyl | 1.0 | 1980.0 | 10.0 | 1006.0 | 1 | 6.0 | 1 | 0 | 8303 | 2022-05-23 | 233100.0 |
| 5 | 98593 | Residential | 50074000 | 9360 N 85TH ST | NaN | 9 | 40 | Ranch | Aluminum/Vinyl | 1.0 | 1982.0 | 5.0 | 1007.0 | 1 | 3.0 | 1 | 0 | 7200 | 2022-07-25 | 215000.0 |
| 6 | 98604 | Residential | 50085000 | 9305 N BURBANK AV | NaN | 9 | 40 | Ranch | Aluminum/Vinyl | 1.0 | 1984.0 | 5.0 | 1301.0 | 1 | 3.0 | 2 | 0 | 7200 | 2022-03-29 | 150000.0 |
| 7 | 98608 | Residential | 50089000 | 9217 N 83RD ST | NaN | 9 | 40 | Colonial | Aluminum/Vinyl | 2.0 | 2007.0 | 9.0 | 2237.0 | 1 | 4.0 | 2 | 1 | 15677 | 2022-05-10 | 400000.0 |
| 8 | 98696 | Condominium | 70017000 | 9192 N 70TH ST, Unit 2 | NORTHRIDGE WOOD LAKE | 9 | 5010 | Condo Townhouse | NaN | 2.0 | 1973.0 | 7.0 | 1437.0 | 1 | 3.0 | 2 | 1 | 0 | 2022-05-16 | 122000.0 |
| 9 | 98715 | Condominium | 70036000 | 9212 N 70TH ST, Unit 8 | NORTHRIDGE WOOD LAKE | 9 | 5010 | Condo Townhouse | NaN | 2.0 | 1973.0 | 7.0 | 1437.0 | 1 | 4.0 | 2 | 1 | 0 | 2022-04-14 | 123000.0 |
| PropertyID | PropType | taxkey | Address | CondoProject | District | nbhd | Style | Extwall | Stories | Year_Built | Rooms | FinishedSqft | Units | Bdrms | Fbath | Hbath | Lotsize | Sale_date | Sale_price | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 7136 | 260546 | Residential | 7160241000 | 1821 W SALEM ST | NaN | 13 | 4920 | Ranch | Aluminum/Vinyl | 1.0 | 1960.0 | 5.0 | 965.0 | 1 | 3.0 | 1 | 0 | 6000 | 2022-07-22 | 220000.0 |
| 7137 | 260559 | Residential | 7160254000 | 6507 S 17TH ST | NaN | 13 | 4920 | Ranch | Aluminum/Vinyl | 1.0 | 1960.0 | 5.0 | 1060.0 | 1 | 3.0 | 1 | 1 | 6048 | 2022-08-22 | 240000.0 |
| 7138 | 260584 | Residential | 7160279000 | 6444 S 18TH ST | NaN | 13 | 4920 | Ranch | Aluminum/Vinyl | 1.0 | 1961.0 | 5.0 | 982.0 | 1 | 3.0 | 1 | 0 | 7000 | 2022-08-30 | 195000.0 |
| 7139 | 260588 | Residential | 7160283000 | 6465 S 18TH ST | NaN | 13 | 4920 | Ranch | Aluminum/Vinyl | 1.0 | 1960.0 | 10.0 | 965.0 | 1 | 6.0 | 1 | 0 | 6093 | 2022-10-12 | 260000.0 |
| 7140 | 260630 | Condominium | 7160327000 | 1928 W SALEM ST | COLLEGE HEIGHTS | 13 | 5360 | Low Rise 1-3 Stories | NaN | 1.0 | 1974.0 | 5.0 | 1141.0 | 1 | 2.0 | 2 | 0 | 1 | 2022-11-21 | 159900.0 |
| 7141 | 260642 | Condominium | 7160339000 | 1912 W SALEM ST | COLLEGE HEIGHTS | 13 | 5360 | Low Rise 1-3 Stories | NaN | 2.0 | 1974.0 | 10.0 | 1100.0 | 1 | 4.0 | 1 | 1 | 1 | 2022-03-11 | 125900.0 |
| 7142 | 260654 | Condominium | 7160351000 | 6316 S 20TH ST | COLLEGE HEIGHTS | 13 | 5360 | Low Rise 1-3 Stories | NaN | 1.0 | 1974.0 | 5.0 | 1379.0 | 1 | 2.0 | 1 | 1 | 1 | 2022-10-28 | 150000.0 |
| 7143 | 260668 | Condominium | 7160365000 | 6376 S 20TH ST | COLLEGE HEIGHTS | 13 | 5360 | Low Rise 1-3 Stories | NaN | 2.0 | 1974.0 | 10.0 | 1100.0 | 1 | 4.0 | 1 | 1 | 1 | 2022-03-15 | 130000.0 |
| 7144 | 260669 | Condominium | 7160366000 | 6378 S 20TH ST | COLLEGE HEIGHTS | 13 | 5360 | Low Rise 1-3 Stories | NaN | 2.0 | 1974.0 | 5.0 | 1100.0 | 1 | 2.0 | 1 | 1 | 1 | 2022-12-30 | 123000.0 |
| 7145 | 260678 | Condominium | 7160375000 | 6354 S 20TH ST | COLLEGE HEIGHTS | 13 | 5360 | Low Rise 1-3 Stories | NaN | 1.0 | 1974.0 | 5.0 | 1141.0 | 1 | 2.0 | 1 | 1 | 1 | 2022-07-08 | 157500.0 |